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Abstract. It has been suggested that recent cosmological and flavor-oscillation data favor the ex- 
istence of additional neutrino species beyond the three predicted by the Standard Model of particle 
physics. We apply Bayesian model selection to determine whether there is indeed any evidence from 
current cosmological datasets for the standard cosmological model to be extended to include addi- 
tional neutrino flavors. The datasets employed include cosmic microwave background temperature, 
polarization and lensing power spectra, and measurements of the baryon acoustic oscillation scale and 
the Hubble constant. We also consider other extensions to the standard neutrino model, such as mas- 
sive neutrinos, and possible degeneracies with other cosmological parameters. The Bayesian evidence 
indicates that current cosmological data do not require any non-standard neutrino properties. 
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1 Introduction 

In the past decade there has been great progress in neutrino physics: in particular, significant evidence 
has been found for non-zero neutrino masses from observations of neutrino oscillations. The Standard 
Model of particle physics has three massless neutrinos, and thus massive neutrinos require beyond-the- 
Standard-Model physics or uncomfortable fine-tuning. When considering extensions of the Standard 
Model, it is not unreasonable to explore the possibility of more than three neutrino species. This 
subject has recently received renewed attention (see Ref. [1] and references therein for a thorough 
review): in particular, extra sterile neutrinos have been invoked to explain recent anomalies observed 
in neutrino experiments, most notably results from the LSND and MiniBooNe experiments [2-5]. 

The expansion history of the early Universe depends on the physical energy density in relativistic 
particles, which can be expressed in terms of the energy density in photons (highly constrained by 
the measurement of the cosmic microwave background (CMB) temperature) and an effective number 
of neutrino species N^. 1 Cosmology therefore offers two windows to constrain N c g: nucleosynthesis, 
where the abundances of the light elements are driven by the Hubble rate, and large-scale structures 
- in particular the CMB - which probe the radiation density at matter-radiation equality and at 
recombination. 

A deviation from the Standard Model prediction for N c s is not necessarily evidence for new 
neutrino physics: any unorthodox early-Universe expansion history, for example due to the presence 
of non-standard energy density (dubbed "dark radiation"), can be parametrized as a deviation from 
the expected number of neutrino species. 2 Limits on possible deviations from N c g — 3.046 therefore 
place constraints on neutrino physics and on any other process that changes the expansion history [11- 
18]. In this work we concentrate on constraints from the CMB and large-scale structure, phrasing 
the results in terms of neutrinos rather than dark radiation. Current cosmological evidence for extra 
neutrinos from CMB and large-scale structure observations is discussed in Refs. [1, 19-31]; however, 
see Refs. [32-34] for a different interpretation. 

Given the fact that the Standard Model firmly predicts three neutrino families, and that dark 
radiation would signal that the standard cosmological model, ACDM, is incorrect, the issue of whether 
cosmology favors extra neutrino species is an issue of model selection rather than parameter estimation. 
In model selection, which must be performed within the Bayesian framework to be self-consistent [35] , 

In the Standard Model N c g = 3.046, differing from the number of neutrino species N„ = 3 to account for QED 
effects, for neutrinos being not completely decoupled during electron-positron annihilation and other small effects, see 
e.g. Refs. [6, 7]. 

2 Provided the extra radiation responsible for changing the expansion is free-streaming: interacting dark radiation 
(and, indeed, neutrinos) impart different signatures (see Refs. [8—10]). 
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the statistic used to select between models is the Evidence, E. The Evidence is the "model-averaged 
likelihood" - i.e. the integral of the likelihood over the parameter volume, weighted by the prior - and 
thus encodes the full predictive power of the model. The Evidence has arbitrary normalization and 
is thus not useful in its own right; however, the ratio of the Evidence values for two models given the 
same data expresses the relative odds that these models are responsible for the observed state of the 
Universe. We capitalize the Bayesian Evidence where used to help distinguish it from the colloquial 
"evidence" . 

While neutrinos have mass, the standard cosmological model traditionally contains massless 
neutrinos under the assumption that neutrino masses are too small to leave a measurable effect on 
cosmological observables. However, forthcoming cosmological data will have enough statistical power 
to detect the signature of non-zero neutrino masses, even if they are close to the lower limit allowed by 
oscillation experiments [36, 37]; they have the potential to constrain the sum of the neutrino masses, 
M v = " m„. We therefore also consider the Evidence for models with massive neutrinos, as well as 
possible degeneracies of neutrino properties with other cosmological parameters. Recent constraints 
on M v from a range of cosmological datasets include Refs. [38-41]. 

The paper is organized as follows. In Sec. 2 we review the Bayesian Evidence, its interpreta- 
tion, calculation and alternatives, and introduce the datasets and models we explore. In Sec. 3 we 
present our results. We compute standard Bayesian parameter estimates and prior-independent pro- 
file likelihood ratios in addition to Evidence ratios, which are presented for different model-dataset 
combinations. We are therefore able to compare our model-selection results to the parameter esti- 
mates traditionally used in the literature, as well as investigating the effects of the choice of priors on 
our findings. Finally, we conclude in Sec. 4. 

2 Method and Datasets 

The majority of neutrino analyses to-date employ parameter estimation to assess the need for addi- 
tional neutrino physics, using different combinations of datasets to determine the most likely values 
of TVeff and M v . These analyses are typically carried out in the Bayesian framework (see Ref. [1] for a 
review) but there are exceptions, such as Refs. [32, 33]. While such analyses provide estimates of the 
most likely parameter values given a particular model of neutrino physics, they are unable to provide 
a comparison of the relative probabilities of different models. 

A deviation from the standard value of N e g would signify that one or both of the standard models 
of cosmology and particle physics are incorrect or incomplete. Extended models, in which the neutrino 
mass and/or number of species are allowed to vary from the standard values, should therefore be 
compared with the standard model using model selection. To perform model selection, we must work 
within the Bayesian framework, in which models are selected based upon their relative Evidence values; 
parameter constraints are also produced as a by-product of the Evidence calculation. To explore the 
prior-dependence of our results, we also report a frequentist approximation to model selection - the 
profile likelihood ratio - which relies only on the likelihood and is thus prior-independent. 

Our work is novel not only because we consider the Bayesian Evidence for additional neutrino 
physics, but also because we include measurements of weak gravitational lensing of the CMB by 
large-scale structure (see e.g. Ref. [42] for a review) in our analysis. The CMB lensing signal, 
first detected by the Atacama Cosmology [43] and South Pole [41] Telescopes, peaks at redshifts of 
~ 2 and is sensitive to the growth of structure. Adding the CMB lensing power spectrum to the 
temperature power spectrum therefore adds information about the late-time Universe, including the 
effect of massive neutrinos. 

2.1 Model Selection and Bayesian Evidence 

The Bayesian Evidence E for a model M describing a data vector d with an TV-dimensional set of 
parameters a is given by 




(2.1) 
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where Pr(d|a,M) is the likelihood (the probability of obtaining the data given the model and a 
particular set of parameter values) and Pt(a\M) is the prior probability on the model parameters. 
Eq. 2.1 is a multi-dimensional integral over a volume defined by the parameter ranges permitted by 
the prior: it is the probability of the obtaining the data given the full model, Pr(d|M), rather than 
given a particular set of parameters. The Evidence depends both on the height of the likelihood and 
the total volume of the parameter space: thus a model in which the likelihood is significantly non-zero 
over a large fraction of the permitted parameter volume is more predictive than a model where a 
small perturbation from the best- fit parameters yields a significant degradation in fit. 

The Evidence describes the probability of getting a set of measurements given a model; for model 
selection, we want to instead compare the probabilities of a pair of models being true given a set of 
data. Bayes' theorem [44] allows us to relate these quantities as 

Pr(M!|d) = Pr(Mi) E 1 

Pr(M 2 |d) Pt(M 2 )E 2 ' 1 ' ' 

where the a priori preference for model Mi over M 2 , Pt(M±) /Pr(M 2 ), is typically set to unity. Under 
this assumption, the Evidence ratio, Ei/E 2 , between two models yields the ratio of probabilities for 
the two models in the light of the data, or equivalently the relative "betting odds" for two models. 

Evidence ratios are often compared using the Jeffreys' scale [45] (in fact, a slightly modified 
version [46]), which rates | A In ii 1 1 = | ln(£^i/£^2)| < 1 as being "not worth a bare mention", whereas 
| A In E | > 5 is regarded as "highly significant". Values ofl < |Alni?| < 2.5 indicate "substantial" 
evidence for the model with higher E; values in the range 2.5 < |Alni?| < 5 indicate "strong" 
evidence. In terms of odds, "substantial" evidence corresponds roughly to 12:1; "strong" evidence to 
150:1. It is important to note that this scale is empirically calibrated and should be used only as a 
guide: strict adherence to it carries all of the risks inherent in the use of any thresholded scale. 

We use the publicly available camb [47] Boltzmann solver code to compute the theoretical 
CMB angular power spectra and BAO scale for different cosmologies, combined with a MultiNest- 
enabled [48, 49] version of the CosmoMC [50] package to compute the Bayesian Evidence and parameter 
estimates. We use 800 MultiNest live points and set the tolerance and efficiency parameters to their 
recommended values of 0.3 to ensure the accuracy and precision of the Evidence calculation. With 
these settings, the precision of the Evidence calculation is o~\ n E ~ 0.3. 

The Evidence is, by definition, sensitive to the prior choice. It is therefore extremely important 
to set the priors independently of the data used and, in some cases, to explore the prior dependence. 
We pay careful attention to the prior ranges in Section 2.4. 

2.2 Profile Likelihood Ratio 

The Bayesian Evidence provides a self-consistent framework with which to perform model selection, 
but, as with all Bayesian methods, it depends on the prior choice. In some cases the priors can 
be physically motivated, but when this is not possible the Bayesian answer to the model selection 
issue cannot be considered definitive. It is therefore interesting to consider statistics which rely only 
on the likelihood and are thus prior-independent, even if these do not provide a fully consistent 
model-selection criterion. Here we use the profile likelihood ratio (PLR) (see e.g. Refs. [32, 33, 51]). 
Assuming a model, M, has N — 1 uninteresting parameters a (which are marginalized over in the 
standard Bayesian approach) and one parameter, (3, on which we want to report constraints, the 
PLR is the ratio between the conditional maximum likelihood for a fixed value of (3 (say, /3») and the 
unconditional maximum likelihood: 

PTPfg ^- max[Pr(d|a,/3 = /3,,M)] 

PLK[P * } ~ max[Pr(d|a,/3,M)] ' (2 ' 3) 

In other words, the PLR is the ratio of the maximum likelihood for each value of j3 to the overall 
maximum likelihood (allowing the other parameters to take any permitted values). 

This quantity has an interpretation similar to the A^ 2 , where the effective chi-square is identified 
with — 21nPr(d|a,/?, M) and, by construction, is prior-independent. Assuming Gaussian statistics, 
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one may therefore associate the PLR values of 0.5 and 2 with one- and two-cr confidence limits. The 
PLR is particularly useful in testing nested models: cases where a more-complex model has a free 
parameter (/?) which is fixed in the simpler model. The extensions to ACDM considered in this work 
fall into this category. If the best-fit value for this parameter differs from the standard, nested value 
at > na one reports an n-a "evidence" for that parameter, and thus for the associated model. One 
must keep in mind that the confidence intervals may not have strict frequentist coverage, especially 
if the likelihood is far from Gaussian; nevertheless it allows us to investigate whether preferences for 
extra parameters in posterior confidence intervals are driven by the data or by the prior. 

Eq. 2.3 describes the ideal profile likelihood ratio, which would be calculable given an infinitely- 
fine sampling regime in the parameter j3. In practice, given the sampling frequencies accessible to 
nested sampling or MCMC methods, one must instead calculate a proxy to the PLR, where the 
complex model's likelihood is first binned in the parameter of interest. The PLR is then given by the 
ratio of these binned maximum likelihoods to the maximum likelihood overall: 

PTRffl x max[Pr(d|c*,/3 = /3,±A/3,M)] 

PLR( ^ } " max[Pr(d|a,/3,M)] " {2A} 

2.3 Datasets 

We consider the datasets listed below in our analysis. 

• Seven-year data from the Wilkinson Microwave Anisotropy Probe (WMAP) satellite [52]. In 
particular we consider the angular power spectra (Cg) of the temperature and E-mode polariza- 
tion signals and their cross-correlation. We use the likelihood routine released by the WMAP 
collaboration, which is available for download from the Legacy Archive for Microwave Back- 
ground Data Analysis (LAMBDA) website. 3 

• The South Pole Telescope (SPT) measurement of the damping tail of the CMB temperature Cg. 
We follow Ref. [53] in computing the SPT likelihood, setting £ max = 3000, and marginalizing over 
foreground contributions from unresolved point sources and Sunyaev-Zel'dovich (SZ) clusters. 

• The South Pole Telescope measurement of the signature of weak gravitational lensing of the 
CMB by large-scale structure, yielding an estimate of the power spectrum of the projected 
gravitational potential [41] (SPTLens). In the Appendix we describe in detail how this likelihood 
is computed. We make the Fortran likelihood code written for this work publicly available for 
download. 4 

• Measurements of the expansion history of the Universe at low redshifts (z < 1) using observations 
of the baryon acoustic oscillation scale (BAO) by WiggleZ [54] (at (z) = {0.44, 0.6, 0.73}) and 
the Sloan Digital Sky Survey [55] (at (z) = {0.2, 0.35}). 

• A measurement of Hq from Ref. [56] . 

The CMB temperature and lensing data we use are shown in the left-hand plots of Figs. 1 and 2. 5 
Note that to make the damping tail more visible we have plotted the angular temperature power 
spectrum in the form of £ 3 (£ + l)Ce/(2n). 

In our analysis we use WMAP data alone only in few reference cases, as the degeneracies be- 
tween N e g and other parameters are not resolved with WMAP data alone. We also use the combina- 
tions WMAP+SPT, WMAP+SPT+SPTLens, WMAP+SPT+SPTLens+BAO, and WMAP+SPT+ 
SPTLens+i/o- These datasets represent the state of the art circa mid 2012. Since then new cosmo- 
logical data - notably new WMAP [57], BAO [58], SPT [59] and Planck [60] results - have become 



3 http : //lambda. gsf c .nasa. gov/product/map/dr4/m_products . cf m 

4 http : //zuserver2 . star .ucl . ac .uk/~smf /spt_lensing_likelihood. tar . gz 

5 Note that the CMB lensing power spectra are plotted as a function of L, rather than I. As CMB lensing induces 
couplings between modes with different I values, the signal is estimated as a function of the vector L, the sum of two 
harmonic-space CMB modes: L = li + I2. The magnitude of this quantity, L, is still an inverse angular scale, and 
lensing power spectra can thus be interpreted in exactly the same way as temperature power spectra. 
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Figure 1. Left: CMB temperature power spectrum measurements from the WMAP satellite (black diamonds) 
and the South Pole Telescope (green squares). The SPT data include foreground contributions from Poisson- 
distributed and clustered point sources, as well as the kinetic and thermal Sunyaev-Zel'dovich effects; the scale 
of this contribution is indicated by the red solid line. To illustrate the degeneracy between N e g and O c /i 2 , 
power spectra for nearly-degenerate models with N B g equal to 3.046 (solid), 2.0 (dashed) and 5.0 (dot-dashed) 
are overlaid on the WMAP and SPT data. Right: the fractional differences of the power spectra with respect 
to ACDM are shown as a function of scale, with error bars from WMAP and SPT indicated by the dark- and 
light-grey regions, respectively. 




Figure 2. Left: CMB lensing data from the South Pole Telescope (black diamonds). Overlaid are power 
spectra for ACDM (solid) and for nearly-degenerate models with N c g equal to 4.0 (dotted), 5.0 (dashed), 5.8 
(dot-dashed) and 6.0 (triple-dot-dashed). Right: the fractional differences of the power spectra with respect 
to ACDM are shown as a function of scale, with SPT error bars indicated by the grey shaded region. 



public, which we have not included for several reasons. The first, and purely practical, consideration 
is that each Evidence run takes a significant amount of time, and one is therefore forced to "freeze" 
the data considered early in the analysis. Furthermore: 

• The WMAP9 data are fully in agreement with the previous release and, coupled with the fact 
that we use WMAP data alone in only a few selected cases, the difference between using WMAP7 
and WMAP9 data is negligible. 

• The impact of BAO data on our findings is, at the moment, only marginal, as already noted in 
Ref. [26]. We therefore argue that the increased statistical power of new BAO data, especially 
in combination with other datasets, will not significantly change our findings. Moreover, as the 
authors of the relevant papers point out (in, for example, Sec. 8 of Ref. [58]), there seems to be 
a hint of tension between BAO and CMB data, which we will discuss further below. 
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• The new SPT data were released late in the process of writing this manuscript, and also exhibit 
a mild tension with other datasets [61]. Since the source of this tension is unknown at present, 
and systematics may potentially play a role, we do not include these new data in this work. 

• The Planck data were released after this work was submitted for publication. An analysis 
including the Planck data will be presented in forthcoming work [62] . 

2.4 Models and Priors 

The models considered in this work are listed below. In all cases, the geometry is assumed to be flat. 

• A baseline power-law ACDM model with three massless neutrinos (N e s = 3.046): ACDM. 

• A power-law ACDM model with massless neutrinos in which 7V e ff is allowed to vary: ACDMA e g . 

• A power-law ACDM model with three neutrinos with non-zero mass: ACDMM,,. 

• A power-law ACDM model with massive neutrinos in which N c g is allowed to vary: ACDMM„iV e ff . 

• A power-law ACDM model with massless neutrinos in which N e g is allowed to vary and the 
power-law spectral index is allowed to run: aACDMA c ff . 

We use a uniform prior on the following parameters: the physical cold dark matter density, 
£l c h 2 ; the physical baryon density, r2&/i 2 ; exp(— 2r) (r being the optical depth to the last scattering 
surface); the power spectrum slope, n s ; the running of the spectral index, a — dn s /dhik, where k is 
the wavenumber; and the log of the scalar amplitude, log A s . For these parameters the range of the 
prior does not affect the Evidence calculation for neutrino properties as long as the prior encompasses 
the region where the likelihood is significant, which is the case here. We also use a uniform prior 
on the parameter 9 because it roughly linearizes the dependence of the Ces on the cosmological 
parameters [63], and is therefore a good sampling parameter. In standard ACDM or oACDM, 9 is the 
the ratio of the sound horizon to the angular diameter distance at decoupling. CosmoMC computes this 
quantity approximately, using the fitting formula of Ref. [64] to compute the redshift of decoupling, 
which is not valid for models with, for example, non-standard neutrinos. The approximation for the 
redshift of decoupling is good to 4% in models with non-standard neutrinos. Thus in such models the 
parameter 9 is only approximately the ratio of the sound horizon to the angular diameter distance. 
The actual calculation for CMB anisotropies is done correctly using the parameter Hq: by sampling 
uniformly in the parameter 9 computed with the approximation from Ref. [64] one is effectively using 
a slightly non-uniform prior on the true ratio of the sound horizon to the angular diameter distance. 
Some authors have avoided this problem by sampling uniformly in Ho. However, Hq and 9 are heavily 
degenerate with, for example, N e g and so the priors imposed on these parameters directly affect the 
calculation of the Evidence. 

We assume a uniform prior on N e g in the range 1.047 < N e s < 9.0. The upper limit is chosen 
to match that used in the SPT analysis of Ref. [41]. The lower limit is imposed by the fact that the 
approximation used by CosmoMC to calculate the redshift of decoupling [64] breaks down at this point. 
In the majority of cases, the likelihood is effectively zero near the minimum value of N c s considered, 
and no contribution to the Evidence is therefore missed. Truncating the N e s prior at 1.047 instead 
of zero increases the log-Evidence (via the reduction in prior volume) by 0.12: as this is smaller than 
the typical sampling precision in calculating the log-Evidence (0.3) we ignore this effect. 

The variation of M v is achieved by sampling the fraction, /„, of the dark matter that is in 
neutrinos [65] 

3v fl c h 2 0^ 2 94eV 1 ' 

We assume a uniform prior on f v in the range < f v < 0.43. The lower limit corresponds to massless 
neutrinos; the upper limit is the maximum contribution neutrinos would make to the dark matter 

6 The factor of 94 eV relates the energy density and total mass of N v neutrino species with a common temperature. 
When simultaneously sampling JV e g and /„, M„ is only strictly the sum of the neutrino masses if the extra neutrinos 
are assumed to be at the same temperature as the Standard Model neutrinos. 
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Table 1. Summary of the combinations of models and datasets for which the Evidence is calculated (E). For 
the abbreviations used, see the main text. 



assuming M u ~ 2 eV and the minimum physical dark matter density allowed by the prior. The priors 
we adopt on f v and Vt c h 2 allow the neutrino mass to range between 0.0 < M v < 8 eV, slightly larger 
than the upper limit given by anti-electron- neutrino mass measurements [66] . 

In Table 1 we present the combinations of datasets and models we consider. 

3 Results 

We begin by presenting the parameter constraints, then move to model selection and report the 
Evidence for deviations from Standard Model neutrinos and the profile likelihood ratio. 

3.1 Parameter Estimation 

Parameter estimates for the various models and data combinations considered are shown in Table 2. 
Constraints on the summed neutrino mass, M v , are presented as 95% confidence upper limits; all 
other parameter estimates are mean-posterior values with 68% confidence-limit errors. Concentrating 
first on the ACDMiV e ff model, we see that, for the majority of data combinations, the mean posterior 
value of Nes is higher than the Standard Model prediction by approximately 1.5a. We note that 
the SPT lensing data do not contribute significantly to the constraints. While in principle the CMB 
lensing signal is sensitive to cosmological parameters such as N B g and M„, the use of the signal is 
in its infancy, having been detected for the first time only in the last year or so. The error bars are 
still too large to contribute significantly to parameter constraints, but a reduction of the error-bar 
magnitude by a factor of few, especially on large-scales (L < 500), should change this (as can be 
appreciated from the right-hand panel of Fig. 2). Increased sky coverage could easily reduce the error 
bars by the required factor. 

Increasing the amount of relativistic matter in the early Universe affects the expansion history, 
and in particular, the redshift of matter-radiation equality, z cq . The effects of additional massless 
neutrinos - or equivalently dark radiation - are therefore degenerate with parameters such as the 
physical dark matter density, fl c h 2 . CMB temperature power spectra for two cosmologies with the 
same 1 + z cq as ACDM but non-standard N e g are plotted in Fig. 1 alongside the WMAP and SPT 
measurements and errors. The raw power spectra (left) are nearly indistinguishable by eye; the 
differences are only apparent from the fractional difference plot (right), in which the measurement 
errors are plotted as grey shaded regions: the power spectra are all compatible with CMB experiments 
to-date. Equivalent plots for the CMB lensing power spectra are shown in Fig. 2. 

The degeneracy between jV e ff and fl c h 2 is also clear to see in the parameter constraints plotted in 
Fig. 3 for all dataset combinations considered. Marginalizing over this degeneracy, which extends to 
large values of N c g, will naturally bias the posterior- mean of N e g to larger values. Adding the extra 
data considered constrains the cosmology better, narrowing and reducing the length of the degeneracy 
but not breaking it. The marginalized constraint on N e g is therefore not enough to claim evidence for 
extensions to the Standard Model. We instead turn to the Bayesian Evidence and profile likelihood 
ratio (in Sections 3.2 and 3.3, respectively) to determine whether there is any convincing evidence 
for a non-standard number of neutrino species. It should (and, of course, has) been noted that Fiji 2 
is not the only parameter with which N e g is degenerate: the helium fraction, Yfjei is another such 
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3.8 + 0.6 
4.0 + 0.5 


< 1.3 

< 1.3 

< 0.7 

< 0.7 




aACDMAW 
















WMAP+SPT 
WMAP+SPT+SPTLens 


0.12 + 0.01 
0.12 + 0.01 


69 + 6 
69 + 6 


25 + 1 
25 + 1 


0.83 + 0.05 
0.83 + 0.04 


3.0 + 0.9 
3.0 + 0.8 




-0.02 ± 0.02 
-0.02 + 0.02 



Table 2. Parameter-estimation results for the various model-dataset combinations. All parameter estimates 
are presented as mean-posterior values with 68% CL errors, apart from the constraints on the summed neutrino 
mass, M v , which are 95% confidence upper limits. Almost all dataset combinations favor N c a > 3.046 at the 
~ 1.5cr level, unless the primordial power spectrum is allowed to run (aACDM). Degeneracies between AW 
and Q, c h 2 , Ho, Yu e and a, and between M v and as, are clear from the increased uncertainty in the standard 
cosmological parameters when the relevant neutrino parameter is allowed to vary. 

parameter [26], and Hq is another. The degeneracy between these parameters is plotted in Figs. 3 
and 4, and is clearly indicated by the estimated errors on Ye e and H increasing when N c g is allowed 
to vary (see Table 2). 

Parameter constraints for the models featuring a standard and non-standard number of massive 
neutrinos are also presented in Table 2. In all cases M v = is allowed within one-er, and the results 
are therefore presented as two-cr upper limits. Neutrinos with masses of the magnitude allowed by 
CMB data (i.e. M v < 1 eV) are relativistic when the CMB decouples, becoming non-relativistic at 
later times. While still relativistic, massive neutrinos free-stream out of, and hence damp, small-scale 
perturbations [67, 68]. With imperfect measurements, we therefore expect M v to be degenerate with 
as, the present linear-scale mass dispersion on a scale of 8ft- -1 Mpc. As the CMB measures the power 
spectrum before the massive neutrinos have imparted their full suppression, a% and M v are anti- 
correlated: the greater the contribution in massive neutrinos, the lower the present power spectrum 
at small scales predicted by the CMB. This degeneracy is clearly visible in Fig. 4, and could be broken 
with an independent measurement of the small-scale power spectrum. 

The final model we consider is a model with a non-standard number of massless neutrino species 
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Figure 3. The 68% and 95% joint confidence limits on the posterior probability distributions of N e s 
and Q, c h 2 (left), and N cft and H (right), given the WMAP (clear), WMAP+SPT+SPTLens (blue), 
WMAP+SPT+SPTLens+BAO (green) and WMAP+SPT+SPTLens+iio (red) datasets. Constraints from 
the WMAP+SPT datasets are not shown as they are very similar to results from the WMAP+SPT+SPTLens 
combination. The standard value of N e s is indicated by the dot-dashed line. 




Figure 4. The 68% and 95% joint confidence limits on the posterior probability distributions of N e s and 
Yhc, and M v (plotted in units of eV) and cr 8 , given the WMAP (clear), WMAP+SPT+SPTLens (blue), 
WMAP+SPT+SPTLens+BAO (green) and WMAP+SPT+SPTLens+# (red) datasets. The standard value 
of Nes is indicated by the dot-dashed line. 



in which the spectral index of the scalar perturbations is allowed to run. As running can mimic the 
effects of additional neutrinos by suppressing small-scale power, these parameters are degenerate, as 
indicated by the individual (see Table 2) and joint (see Fig. 5) parameter constraints. Nevertheless, 
the standard values of N e g and a are allowed at the one-sigma level, and so there is no evidence from 
parameter constraints to support this model. 



3.2 Model Selection 

The results of our Evidence calculations for each model-dataset combination are presented in Fig. 6. 
For each data combination, we use the Evidence for the basic ACDM model as the baseline for 
comparing models: the values plotted in Fig. 6 are 



AlnE = In 



Pr(d|ACDM) 
Pr(d|M) 



(3.1) 
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Figure 5. The 68% and 95% joint confidence limits on the posterior probability distributions of N e g and a, 
given the WMAP+SPT+SPTLens (blue) datasets. The standard values of N e g and a are indicated by the 
dot-dashed lines. 

where M is the particular extension to ACDM we are considering. A negative log-Evidence ratio 
would therefore indicate support for the more-complex model. 

The immediate observation from Fig. 6 is that the Evidence is, in the majority of cases, substan- 
tially in favor of vanilla ACDM. This suggests that there is not yet any evidence from cosmological 
data for additional neutrino species or massive neutrinos. This is particularly interesting in the case 
of the ACDMAf„ model, which has arguably the strongest physical motivation: neutrino oscillation 
measurements require neutrinos to have mass. Though the prior used in this case is somewhat ad 
hoc - the limits are well-motivated, but its uniform distribution is a choice - even by "cheating" 
(in a Bayesian sense at least, by using the data twice) and setting the maximum M v value to that 
allowed by previous WMAP analyses the Ami? values would only be reduced by ~ In 4 ~ 1.4. This 
is sufficient in some cases to tip the balance in favor of ACDMM F , but not with any significance. 

3.3 Profile Likelihood Ratio 

The priors applied to the additional neutrino parameters N e g and M v are, to a greater or lesser 
extent, phenomenologically-motivated, insofar as they are not derived from fundamental physical 
considerations. As we cannot therefore be confident that these priors are entirely appropriate, we 
employ the profile likelihood ratio (PLR) as a prior-independent, if not entirely self-consistent, model 
selection criterion. We concentrate on the ACDMiV e g model, as this is the only model for which the 
parameter estimates and Evidence ratios are in apparent disagreement. 

The PLR for N e g assuming the ACDMiV e g model is plotted in Fig. 7. To generate these plots, 
the PLR is calculated as described in Eq. 2.4, identifying a with the standard parameters of ACDM 
and /3 with N Q g, and using bins of width AN c g = 0.5. For illustrative purposes, we have chosen to 
normalize with respect to the maximum likelihood found for pure ACDM. Estimates of the scatter on 
the PLR introduced by sampling are obtained by first finding the difference between the maximum 
ACDM likelihood and the maximum ACDMiV c ff likelihood in a small bin of width 0.01 centered on 
N c g = 3.046. The error on each PLR bin is then estimated by weighting this difference by the square 
root of the ratio of the number of samples in the PLR bin to the number of samples in the small 
N cS ~ 3.046 bin. 

For all dataset combinations, the standard value of N e g is well within two-cr of the maximum 
likelihood recorded: indeed, taking into account the sampling-induced scatter, the difference is on 
the one-cr level for all but the SPT+WMAP datasets. This is in good agreement with the posterior 
parameter constraints, and indicates at most a mild preference in the data for a non-standard value 
of N e g. This preference is not large enough to push the Bayesian Evidence in favor of the extended 
model. 
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Figure 6. Evidence ratios for the various extended models compared to the simple ACDM power-law model, 
for each dataset combination. Top left: evidence for ACDM with a non-standard effective number of neutrino 
species (i.e. N a g 7^ 3.046). Top right: evidence for adding non-zero total neutrino mass (M„) to ACDM. 
Bottom left: evidence for adding both a non-zero neutrino mass and a non-standard effective number of 
neutrino species to ACDM. Bottom right: evidence for adding a non-standard effective number of neutrino 
species to ACDM and allowing a running (a) of the scalar spectral index. For all models, and all dataset 
combinations, the Evidence ratio is either inconclusive or substantially favors the simpler model, ACDM. The 
data do not favor the addition of the extra parameters considered in this analysis. 



4 Conclusions 

Recent cosmological and particle physics results have suggested that the current model of the neutrino 
should be extended to include additional species and/or mass. In this work, we have applied Bayesian 
model selection to determine whether current cosmological datasets truly favor models with extended 
neutrino physics over the standard picture. The datasets considered include CMB temperature and 
polarization power spectra, measurements of the BAO scale and the Hubble constant, as well as an 
integrated measurement of the growth of structure in the form of the CMB lensing power spectrum. 
The model selection results are compared to traditional (parameter estimation) and prior-independent 
(profile likelihood ratio) methods to understand the current findings and explore the prior-dependence 
of our conclusions. 

Although parameter estimates for the effective number of neutrino species, N e g , are higher than 
the standard value, our Bayesian model selection results show that current cosmological data favor 
the standard cosmological model over one with additional neutrino species. The high estimate of 
iV e ff is likely due to marginalization over a degeneracy with cosmological parameters [33], a finding 
corroborated by the profile likelihood ratio. Similarly for the sum of neutrino masses, M v , we find 
that the current cosmological data prefer ACDM over ACDM with massive neutrinos. We also note 
that the SPT CMB lensing data are not precise enough to strongly impact our findings; however, 
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Figure 7. Profile likelihood ratios comparing vanilla ACDM to ACDM with a non-standard effec- 
tive number of neutrino species. Results are shown for the WMAP (top left), WMAP+SPT and 
WMAP+SPT+SPTLens (top right, solid and dashed lines, respectively), WMAP+SPT+SPTLens+BAO 
(bottom left) and WMAP+SPT+SPTLens+i/o (bottom right) dataset combinations. Estimates on the un- 
certainty of the PLR values are plotted; the standard value of AW is indicated by the dot-dashed line. 



the increased sky coverage provided by Planck - publicly released after this work was submitted 
for publication - now allows CMB lensing data to play a critical role in defining our model of the 
Universe [69] . An analysis including the newly released Planck data will be presented in a forthcoming 
paper [62]. 
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SPT Lensing Likelihood 

The SPT lensing likelihood is a Gaussian likelihood based upon the lensing potential power spectrum, 
C* = L^Cff, following the camb naming convention. Recall that L is the magnitude of the sum of 
two CMB modes in harmonic space: L = |li + 12 1 - SPT measures the power spectrum in bands, B, of 
width AL = 100, centered onL = {150, 250 . . . , 1450}. The log-likelihood for a model M is therefore 

lnPr(d|M) = -±£ [C% C%{M)] C^,(M) [d% - Cf,(M)] T + In |C(M)| - ln|C fid |, (4.1) 

B,B' 

where C 1 * and C*(M) are the measured and predicted powers in band B, respectively, C is the 
bandpower covariance matrix, and we have chosen to normalize by subtracting off the determinant of 
the covariance matrix assuming a fiducial ACDM cosmology. 

As described in Ref. [41], the covariance matrix is constructed from four components: 

• the raw bandpower covariance, C raw , estimated from 2000 lensed CMB simulations of one of 
the SPT fields using a fiducial ACDM cosmology; 

• the lensing-potential bandpowers, Cg fid , of this fiducial cosmology; 

• the so-called zeroth-order bias, \ sourced by confusion between the effects of lensing and 
uncertainty in the primordial CMB signal imprinted at last scattering, and estimated by per- 
forming the lensing reconstruction on unlensed CMB simulations, and; 



additional uncertainty, C cal , induced by the SPT temperature calibration: all four SPT fields 
are rescaled (with some small error) such that the temperature power spectrum in the range 
1200 < t < 3000 matches that of the fiducial ACDM cosmology. 



{C% (M) + Ng^ ) I [C% fid + Ng^ ) to account for changes in sample variance. The resulting 



When sampling different cosmologies, the diagonal of the raw covariance matrix is first rescaled by a 

r 1 2 

factor of (C%(M)+N { °' 

matrix is then added to the calibration-induced uncertainty to produce C(M). 

The Fortran version of the SPT lensing likelihood used in this work can be downloaded from http : 
//zuserver2 . star . ucl . ac . uk/~smf /spt_lensing_likelihood . tar . gz. 
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